Picture for Minlan Yu

Minlan Yu

Harvard University

ScaleAcross Explorer: Exploring Communication Optimization for Scale-Across AI Model Training

Add code
May 23, 2026
Viaarxiv icon

PALS: Power-Aware LLM Serving for Mixture-of-Experts Models

Add code
May 20, 2026
Viaarxiv icon

From Barrier to Bridge: The Case for AI Data Center/Power Grid Co-Design

Add code
May 04, 2026
Viaarxiv icon

Orla: A Library for Serving LLM-Based Multi-Agent Systems

Add code
Mar 13, 2026
Viaarxiv icon

Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases

Add code
Dec 20, 2025
Viaarxiv icon

Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning

Add code
Apr 29, 2025
Figure 1 for Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Figure 2 for Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Figure 3 for Towards Easy and Realistic Network Infrastructure Testing for Large-scale Machine Learning
Viaarxiv icon

HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference

Add code
Feb 05, 2025
Figure 1 for HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference
Figure 2 for HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference
Figure 3 for HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference
Figure 4 for HACK: Homomorphic Acceleration via Compression of the Key-Value Cache for Disaggregated LLM Inference
Viaarxiv icon

NetFlowGen: Leveraging Generative Pre-training for Network Traffic Dynamics

Add code
Dec 30, 2024
Viaarxiv icon

TrainMover: Efficient ML Training Live Migration with No Memory Overhead

Add code
Dec 17, 2024
Figure 1 for TrainMover: Efficient ML Training Live Migration with No Memory Overhead
Figure 2 for TrainMover: Efficient ML Training Live Migration with No Memory Overhead
Figure 3 for TrainMover: Efficient ML Training Live Migration with No Memory Overhead
Figure 4 for TrainMover: Efficient ML Training Live Migration with No Memory Overhead
Viaarxiv icon

Minder: Faulty Machine Detection for Large-scale Distributed Model Training

Add code
Nov 04, 2024
Figure 1 for Minder: Faulty Machine Detection for Large-scale Distributed Model Training
Figure 2 for Minder: Faulty Machine Detection for Large-scale Distributed Model Training
Figure 3 for Minder: Faulty Machine Detection for Large-scale Distributed Model Training
Figure 4 for Minder: Faulty Machine Detection for Large-scale Distributed Model Training
Viaarxiv icon